Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 2173144 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 136288 |
| Duplicate rows (%) | 6.3% |
| Total size in memory | 232.1 MiB |
| Average record size in memory | 112.0 B |
Variable types
| NUM | 8 |
|---|---|
| CAT | 3 |
| BOOL | 3 |
| Dataset has 136288 (6.3%) duplicate rows | Duplicates |
PJ_IDADE has 36122 (1.7%) zeros | Zeros |
Reproduction
| Analysis started | 2020-09-27 18:38:36.221981 |
|---|---|
| Analysis finished | 2020-09-27 18:45:16.405552 |
| Duration | 6 minutes and 40.18 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
CPF
Real number (ℝ≥0)
| Distinct | 1134103 |
|---|---|
| Distinct (%) | 52.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.341521117e+10 |
|---|---|
| Minimum | 1163 |
| Maximum | 9.999999417e+10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.6 MiB |
Quantile statistics
| Minimum | 1163 |
|---|---|
| 5-th percentile | 899480179 |
| Q1 | 4723516301 |
| median | 1.495771024e+10 |
| Q3 | 6.37693105e+10 |
| 95-th percentile | 9.236942645e+10 |
| Maximum | 9.999999417e+10 |
| Range | 9.999999301e+10 |
| Interquartile range (IQR) | 5.90457942e+10 |
Descriptive statistics
| Standard deviation | 3.275267265e+10 |
|---|---|
| Coefficient of variation (CV) | 0.9801725474 |
| Kurtosis | -1.162449459 |
| Mean | 3.341521117e+10 |
| Median Absolute Deviation (MAD) | 1.388050776e+10 |
| Skewness | 0.6192915508 |
| Sum | 7.261606566e+16 |
| Variance | 1.072737566e+21 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 596634307 | 249 | < 0.1% | |
| 8.593706135e+10 | 171 | < 0.1% | |
| 3077961326 | 108 | < 0.1% | |
| 5.849690417e+10 | 96 | < 0.1% | |
| 1.816014681e+10 | 55 | < 0.1% | |
| 2.804770265e+10 | 54 | < 0.1% | |
| 9.959913929e+10 | 50 | < 0.1% | |
| 8086283640 | 48 | < 0.1% | |
| 1.372061577e+10 | 48 | < 0.1% | |
| 7.504062073e+10 | 43 | < 0.1% | |
| 9939804733 | 43 | < 0.1% | |
| 6400472380 | 41 | < 0.1% | |
| 6996085705 | 41 | < 0.1% | |
| 7.483854723e+10 | 40 | < 0.1% | |
| 8110434444 | 39 | < 0.1% | |
| 5411242770 | 38 | < 0.1% | |
| 7.654310227e+10 | 38 | < 0.1% | |
| 1.80463648e+10 | 37 | < 0.1% | |
| 7.927082477e+10 | 37 | < 0.1% | |
| 4.263765177e+10 | 37 | < 0.1% | |
| 1047188708 | 36 | < 0.1% | |
| 1.057116476e+10 | 35 | < 0.1% | |
| 5.359502569e+10 | 35 | < 0.1% | |
| 989437310 | 35 | < 0.1% | |
| 8.626575076e+10 | 35 | < 0.1% | |
| Other values (1134078) | 2171655 | 99.9% |
| Value | Count | Frequency (%) | |
| 1163 | 1 | < 0.1% | |
| 1910 | 1 | < 0.1% | |
| 5150 | 1 | < 0.1% | |
| 41203 | 7 | < 0.1% | |
| 44130 | 1 | < 0.1% | |
| 80527 | 1 | < 0.1% | |
| 83623 | 1 | < 0.1% | |
| 85677 | 2 | < 0.1% | |
| 88692 | 1 | < 0.1% | |
| 100226 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 9.999999417e+10 | 2 | < 0.1% | |
| 9.999989713e+10 | 1 | < 0.1% | |
| 9.999972277e+10 | 3 | < 0.1% | |
| 9.999932312e+10 | 1 | < 0.1% | |
| 9.99992851e+10 | 2 | < 0.1% | |
| 9.999897127e+10 | 2 | < 0.1% | |
| 9.999891217e+10 | 1 | < 0.1% | |
| 9.999880712e+10 | 3 | < 0.1% | |
| 9.999863737e+10 | 1 | < 0.1% | |
| 9.999862927e+10 | 1 | < 0.1% |
CNPJ
Real number (ℝ≥0)
| Distinct | 1067950 |
|---|---|
| Distinct (%) | 49.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.155042501e+13 |
|---|---|
| Minimum | 455000107 |
| Maximum | 9.7711797e+13 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.6 MiB |
Quantile statistics
| Minimum | 455000107 |
|---|---|
| 5-th percentile | 4.25222e+12 |
| Q1 | 1.36643455e+13 |
| median | 2.1801166e+13 |
| Q3 | 2.90993245e+13 |
| 95-th percentile | 3.5063644e+13 |
| Maximum | 9.7711797e+13 |
| Range | 9.7711342e+13 |
| Interquartile range (IQR) | 1.5434979e+13 |
Descriptive statistics
| Standard deviation | 1.112461518e+13 |
|---|---|
| Coefficient of variation (CV) | 0.5162132616 |
| Kurtosis | 7.243497182 |
| Mean | 2.155042501e+13 |
| Median Absolute Deviation (MAD) | 7.72612e+12 |
| Skewness | 1.344338816 |
| Sum | 4.68321768e+19 |
| Variance | 1.237570629e+26 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1.379503e+12 | 106 | < 0.1% | |
| 1.0609938e+13 | 103 | < 0.1% | |
| 9.24845e+12 | 100 | < 0.1% | |
| 7.715251e+12 | 92 | < 0.1% | |
| 7.774211e+12 | 91 | < 0.1% | |
| 5.021025e+12 | 91 | < 0.1% | |
| 2.8016077e+13 | 84 | < 0.1% | |
| 9.116860002e+11 | 83 | < 0.1% | |
| 1.5334477e+13 | 83 | < 0.1% | |
| 2.276269e+13 | 82 | < 0.1% | |
| 5.586590002e+11 | 80 | < 0.1% | |
| 1.0014536e+13 | 77 | < 0.1% | |
| 5.605572e+12 | 77 | < 0.1% | |
| 1.3265725e+13 | 76 | < 0.1% | |
| 1.6808908e+13 | 74 | < 0.1% | |
| 1.0627791e+13 | 73 | < 0.1% | |
| 3.265392e+12 | 69 | < 0.1% | |
| 1.8233963e+13 | 68 | < 0.1% | |
| 1.2076338e+13 | 68 | < 0.1% | |
| 1.4092821e+13 | 66 | < 0.1% | |
| 2.9055907e+13 | 66 | < 0.1% | |
| 3.4882134e+13 | 66 | < 0.1% | |
| 1.349609e+12 | 65 | < 0.1% | |
| 1.5427788e+13 | 64 | < 0.1% | |
| 1.342356e+13 | 63 | < 0.1% | |
| Other values (1067925) | 2171177 | 99.9% |
| Value | Count | Frequency (%) | |
| 455000107 | 1 | < 0.1% | |
| 3129000153 | 1 | < 0.1% | |
| 3251000120 | 1 | < 0.1% | |
| 3574000113 | 5 | < 0.1% | |
| 5058000128 | 2 | < 0.1% | |
| 6486000175 | 1 | < 0.1% | |
| 6817000177 | 1 | < 0.1% | |
| 8151000196 | 1 | < 0.1% | |
| 9290000134 | 1 | < 0.1% | |
| 1.04780001e+10 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 9.7711797e+13 | 2 | < 0.1% | |
| 9.7554556e+13 | 1 | < 0.1% | |
| 9.7554536e+13 | 2 | < 0.1% | |
| 9.7554451e+13 | 1 | < 0.1% | |
| 9.7554433e+13 | 1 | < 0.1% | |
| 9.7554425e+13 | 1 | < 0.1% | |
| 9.7554233e+13 | 2 | < 0.1% | |
| 9.7554202e+13 | 7 | < 0.1% | |
| 9.7554128e+13 | 2 | < 0.1% | |
| 9.7554083e+13 | 1 | < 0.1% |
PF_GENERO
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.6 MiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 1089970 | 50.2% | |
| 0 | 1083174 | 49.8% |
PF_IDADE
Real number (ℝ≥0)
| Distinct | 109 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42.1654345 |
|---|---|
| Minimum | 1 |
| Maximum | 121 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 25 |
| Q1 | 33 |
| median | 41 |
| Q3 | 50 |
| 95-th percentile | 62 |
| Maximum | 121 |
| Range | 120 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 11.54433315 |
|---|---|
| Coefficient of variation (CV) | 0.2737866521 |
| Kurtosis | -0.2961243606 |
| Mean | 42.1654345 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.3683152041 |
| Sum | 91631561 |
| Variance | 133.2716278 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 38 | 75617 | 3.5% | |
| 39 | 74594 | 3.4% | |
| 37 | 71047 | 3.3% | |
| 40 | 70291 | 3.2% | |
| 35 | 69958 | 3.2% | |
| 41 | 69334 | 3.2% | |
| 34 | 68277 | 3.1% | |
| 36 | 67803 | 3.1% | |
| 33 | 66579 | 3.1% | |
| 42 | 66068 | 3.0% | |
| 43 | 64518 | 3.0% | |
| 32 | 64391 | 3.0% | |
| 44 | 62032 | 2.9% | |
| 31 | 59133 | 2.7% | |
| 45 | 58581 | 2.7% | |
| 46 | 57301 | 2.6% | |
| 47 | 54235 | 2.5% | |
| 48 | 54114 | 2.5% | |
| 30 | 52775 | 2.4% | |
| 49 | 52511 | 2.4% | |
| 50 | 51404 | 2.4% | |
| 29 | 50496 | 2.3% | |
| 51 | 47938 | 2.2% | |
| 52 | 47200 | 2.2% | |
| 28 | 45742 | 2.1% | |
| Other values (84) | 651205 | 30.0% |
| Value | Count | Frequency (%) | |
| 1 | 30 | < 0.1% | |
| 2 | 60 | < 0.1% | |
| 3 | 78 | < 0.1% | |
| 4 | 37 | < 0.1% | |
| 5 | 27 | < 0.1% | |
| 6 | 49 | < 0.1% | |
| 7 | 25 | < 0.1% | |
| 8 | 39 | < 0.1% | |
| 9 | 56 | < 0.1% | |
| 10 | 44 | < 0.1% |
| Value | Count | Frequency (%) | |
| 121 | 7 | < 0.1% | |
| 120 | 4 | < 0.1% | |
| 116 | 2 | < 0.1% | |
| 110 | 5 | < 0.1% | |
| 109 | 9 | < 0.1% | |
| 104 | 4 | < 0.1% | |
| 103 | 1 | < 0.1% | |
| 102 | 2 | < 0.1% | |
| 101 | 8 | < 0.1% | |
| 100 | 9 | < 0.1% |
PJ_PORTE
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.6 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 |
| Value | Count | Frequency (%) | |
| 1 | 1315634 | 60.5% | |
| 2 | 652653 | 30.0% | |
| 3 | 204857 | 9.4% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1 | 1315634 | 60.5% | |
| 2 | 652653 | 30.0% | |
| 3 | 204857 | 9.4% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 2173144 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 1315634 | 60.5% | |
| 2 | 652653 | 30.0% | |
| 3 | 204857 | 9.4% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 2173144 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 1315634 | 60.5% | |
| 2 | 652653 | 30.0% | |
| 3 | 204857 | 9.4% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2173144 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1 | 1315634 | 60.5% | |
| 2 | 652653 | 30.0% | |
| 3 | 204857 | 9.4% |
PJ_SETOR
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.6 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | |
| 4 | 7265 |
| Value | Count | Frequency (%) | |
| 1 | 940949 | 43.3% | |
| 2 | 873946 | 40.2% | |
| 3 | 350984 | 16.2% | |
| 4 | 7265 | 0.3% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1 | 940949 | 43.3% | |
| 2 | 873946 | 40.2% | |
| 3 | 350984 | 16.2% | |
| 4 | 7265 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 2173144 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 940949 | 43.3% | |
| 2 | 873946 | 40.2% | |
| 3 | 350984 | 16.2% | |
| 4 | 7265 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 2173144 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 940949 | 43.3% | |
| 2 | 873946 | 40.2% | |
| 3 | 350984 | 16.2% | |
| 4 | 7265 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2173144 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1 | 940949 | 43.3% | |
| 2 | 873946 | 40.2% | |
| 3 | 350984 | 16.2% | |
| 4 | 7265 | 0.3% |
| Distinct | 71 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.947120393 |
|---|---|
| Minimum | 0 |
| Maximum | 89 |
| Zeros | 36122 |
| Zeros (%) | 1.7% |
| Memory size | 16.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 10 |
| 95-th percentile | 25 |
| Maximum | 89 |
| Range | 89 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 7.628429845 |
|---|---|
| Coefficient of variation (CV) | 0.9598986133 |
| Kurtosis | 5.516000662 |
| Mean | 7.947120393 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 2.116795204 |
| Sum | 17270237 |
| Variance | 58.1929419 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 2 | 257388 | 11.8% | |
| 3 | 199669 | 9.2% | |
| 1 | 182764 | 8.4% | |
| 5 | 182341 | 8.4% | |
| 4 | 180015 | 8.3% | |
| 6 | 153586 | 7.1% | |
| 7 | 148616 | 6.8% | |
| 8 | 133965 | 6.2% | |
| 10 | 130896 | 6.0% | |
| 9 | 128785 | 5.9% | |
| 11 | 45518 | 2.1% | |
| 12 | 36152 | 1.7% | |
| 0 | 36122 | 1.7% | |
| 13 | 31245 | 1.4% | |
| 14 | 25408 | 1.2% | |
| 15 | 24559 | 1.1% | |
| 16 | 22757 | 1.0% | |
| 18 | 20203 | 0.9% | |
| 17 | 20192 | 0.9% | |
| 19 | 19638 | 0.9% | |
| 20 | 18205 | 0.8% | |
| 21 | 18132 | 0.8% | |
| 23 | 15903 | 0.7% | |
| 22 | 15447 | 0.7% | |
| 24 | 14026 | 0.6% | |
| Other values (46) | 111612 | 5.1% |
| Value | Count | Frequency (%) | |
| 0 | 36122 | 1.7% | |
| 1 | 182764 | 8.4% | |
| 2 | 257388 | 11.8% | |
| 3 | 199669 | 9.2% | |
| 4 | 180015 | 8.3% | |
| 5 | 182341 | 8.4% | |
| 6 | 153586 | 7.1% | |
| 7 | 148616 | 6.8% | |
| 8 | 133965 | 6.2% | |
| 9 | 128785 | 5.9% |
| Value | Count | Frequency (%) | |
| 89 | 3 | < 0.1% | |
| 79 | 1 | < 0.1% | |
| 72 | 2 | < 0.1% | |
| 71 | 2 | < 0.1% | |
| 70 | 2 | < 0.1% | |
| 69 | 2 | < 0.1% | |
| 68 | 3 | < 0.1% | |
| 64 | 2 | < 0.1% | |
| 62 | 4 | < 0.1% | |
| 61 | 13 | < 0.1% |
PJ_NUM_FUNCIONARIOS
Real number (ℝ≥0)
| Distinct | 101 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.688102583 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros | 554 |
| Zeros (%) | < 0.1% |
| Memory size | 16.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 10 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 5.636453059 |
|---|---|
| Coefficient of variation (CV) | 2.096814718 |
| Kurtosis | 86.35112829 |
| Mean | 2.688102583 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.672872698 |
| Sum | 5841634 |
| Variance | 31.76960308 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 1567997 | 72.2% | |
| 2 | 190972 | 8.8% | |
| 3 | 82394 | 3.8% | |
| 5 | 64104 | 2.9% | |
| 4 | 56858 | 2.6% | |
| 10 | 31983 | 1.5% | |
| 6 | 30782 | 1.4% | |
| 8 | 21205 | 1.0% | |
| 7 | 18893 | 0.9% | |
| 20 | 13525 | 0.6% | |
| 9 | 11924 | 0.5% | |
| 12 | 10956 | 0.5% | |
| 15 | 9703 | 0.4% | |
| 11 | 6201 | 0.3% | |
| 13 | 5181 | 0.2% | |
| 14 | 4747 | 0.2% | |
| 30 | 3998 | 0.2% | |
| 16 | 3714 | 0.2% | |
| 19 | 3652 | 0.2% | |
| 18 | 3472 | 0.2% | |
| 17 | 2603 | 0.1% | |
| 25 | 2492 | 0.1% | |
| 22 | 2120 | 0.1% | |
| 21 | 1590 | 0.1% | |
| 40 | 1569 | 0.1% | |
| Other values (76) | 20509 | 0.9% |
| Value | Count | Frequency (%) | |
| 0 | 554 | < 0.1% | |
| 1 | 1567997 | 72.2% | |
| 2 | 190972 | 8.8% | |
| 3 | 82394 | 3.8% | |
| 4 | 56858 | 2.6% | |
| 5 | 64104 | 2.9% | |
| 6 | 30782 | 1.4% | |
| 7 | 18893 | 0.9% | |
| 8 | 21205 | 1.0% | |
| 9 | 11924 | 0.5% |
| Value | Count | Frequency (%) | |
| 100 | 678 | < 0.1% | |
| 99 | 84 | < 0.1% | |
| 98 | 35 | < 0.1% | |
| 97 | 36 | < 0.1% | |
| 96 | 27 | < 0.1% | |
| 95 | 27 | < 0.1% | |
| 94 | 5 | < 0.1% | |
| 93 | 28 | < 0.1% | |
| 92 | 57 | < 0.1% | |
| 91 | 12 | < 0.1% |
CANAL_ATENDIMENTO
Real number (ℝ≥0)
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.689896758 |
|---|---|
| Minimum | 1 |
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.189526389 |
|---|---|
| Coefficient of variation (CV) | 0.7039047701 |
| Kurtosis | 3.031742746 |
| Mean | 1.689896758 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.924091031 |
| Sum | 3672389 |
| Variance | 1.41497303 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 1412289 | 65.0% | |
| 2 | 395743 | 18.2% | |
| 3 | 147430 | 6.8% | |
| 4 | 99831 | 4.6% | |
| 5 | 80106 | 3.7% | |
| 6 | 37745 | 1.7% |
| Value | Count | Frequency (%) | |
| 1 | 1412289 | 65.0% | |
| 2 | 395743 | 18.2% | |
| 3 | 147430 | 6.8% | |
| 4 | 99831 | 4.6% | |
| 5 | 80106 | 3.7% | |
| 6 | 37745 | 1.7% |
| Value | Count | Frequency (%) | |
| 6 | 37745 | 1.7% | |
| 5 | 80106 | 3.7% | |
| 4 | 99831 | 4.6% | |
| 3 | 147430 | 6.8% | |
| 2 | 395743 | 18.2% | |
| 1 | 1412289 | 65.0% |
TEMA_ATENDIMENTO
Real number (ℝ≥0)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.783945288 |
|---|---|
| Minimum | 1 |
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 8 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.186356353 |
|---|---|
| Coefficient of variation (CV) | 0.577798088 |
| Kurtosis | -0.4163584417 |
| Mean | 3.783945288 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.3715626512 |
| Sum | 8223058 |
| Variance | 4.780154101 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 4 | 624913 | 28.8% | |
| 1 | 553769 | 25.5% | |
| 5 | 460369 | 21.2% | |
| 2 | 173584 | 8.0% | |
| 7 | 134489 | 6.2% | |
| 9 | 77179 | 3.6% | |
| 8 | 62399 | 2.9% | |
| 3 | 44418 | 2.0% | |
| 6 | 42024 | 1.9% |
| Value | Count | Frequency (%) | |
| 1 | 553769 | 25.5% | |
| 2 | 173584 | 8.0% | |
| 3 | 44418 | 2.0% | |
| 4 | 624913 | 28.8% | |
| 5 | 460369 | 21.2% | |
| 6 | 42024 | 1.9% | |
| 7 | 134489 | 6.2% | |
| 8 | 62399 | 2.9% | |
| 9 | 77179 | 3.6% |
| Value | Count | Frequency (%) | |
| 9 | 77179 | 3.6% | |
| 8 | 62399 | 2.9% | |
| 7 | 134489 | 6.2% | |
| 6 | 42024 | 1.9% | |
| 5 | 460369 | 21.2% | |
| 4 | 624913 | 28.8% | |
| 3 | 44418 | 2.0% | |
| 2 | 173584 | 8.0% | |
| 1 | 553769 | 25.5% |
ABORDAGEM_ATENDIMENTO
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.6 MiB |
| 0 | |
|---|---|
| 1 | 78643 |
| Value | Count | Frequency (%) | |
| 0 | 2094501 | 96.4% | |
| 1 | 78643 | 3.6% |
CATEGORIA_ATENDIMENTO
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.6 MiB |
| 0 | |
|---|---|
| 2 |
| Value | Count | Frequency (%) | |
| 0 | 1190005 | 54.8% | |
| 2 | 983139 | 45.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 1190005 | 54.8% | |
| 2 | 983139 | 45.2% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 2173144 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 1190005 | 54.8% | |
| 2 | 983139 | 45.2% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 2173144 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 1190005 | 54.8% | |
| 2 | 983139 | 45.2% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2173144 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 1190005 | 54.8% | |
| 2 | 983139 | 45.2% |
INSTRUMENTO_ATENDIMENTO
Real number (ℝ≥0)
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.582195197 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8780217398 |
|---|---|
| Coefficient of variation (CV) | 0.5549389489 |
| Kurtosis | 1.34748415 |
| Mean | 1.582195197 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.405124792 |
| Sum | 3438338 |
| Variance | 0.7709221756 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 1382480 | 63.6% | |
| 2 | 394381 | 18.1% | |
| 3 | 338619 | 15.6% | |
| 4 | 37081 | 1.7% | |
| 5 | 20583 | 0.9% |
| Value | Count | Frequency (%) | |
| 1 | 1382480 | 63.6% | |
| 2 | 394381 | 18.1% | |
| 3 | 338619 | 15.6% | |
| 4 | 37081 | 1.7% | |
| 5 | 20583 | 0.9% |
| Value | Count | Frequency (%) | |
| 5 | 20583 | 0.9% | |
| 4 | 37081 | 1.7% | |
| 3 | 338619 | 15.6% | |
| 2 | 394381 | 18.1% | |
| 1 | 1382480 | 63.6% |
MEIO_ATENDIMENTO
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.6 MiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 1935443 | 89.1% | |
| 1 | 237701 | 10.9% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| CPF | CNPJ | PF_GENERO | PF_IDADE | PJ_PORTE | PJ_SETOR | PJ_IDADE | PJ_NUM_FUNCIONARIOS | CANAL_ATENDIMENTO | TEMA_ATENDIMENTO | ABORDAGEM_ATENDIMENTO | CATEGORIA_ATENDIMENTO | INSTRUMENTO_ATENDIMENTO | MEIO_ATENDIMENTO | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2.621927e+09 | 1.275835e+13 | 1 | 33 | 1 | 3 | 10 | 1 | 2 | 5 | 0 | 2 | 1 | 1 |
| 1 | 7.664574e+10 | 3.391192e+13 | 1 | 39 | 1 | 2 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 0 |
| 2 | 3.122399e+10 | 2.724528e+13 | 1 | 35 | 2 | 2 | 3 | 1 | 2 | 7 | 0 | 2 | 1 | 0 |
| 3 | 1.926499e+10 | 3.353785e+13 | 0 | 63 | 1 | 4 | 1 | 1 | 1 | 4 | 0 | 0 | 1 | 0 |
| 4 | 9.267746e+10 | 2.838789e+13 | 0 | 43 | 1 | 2 | 3 | 2 | 1 | 1 | 0 | 0 | 1 | 0 |
| 5 | 1.078980e+09 | 3.592992e+13 | 0 | 25 | 1 | 2 | 0 | 1 | 5 | 1 | 0 | 2 | 1 | 0 |
| 6 | 8.455372e+09 | 3.346415e+13 | 1 | 26 | 2 | 3 | 1 | 6 | 1 | 5 | 0 | 0 | 3 | 0 |
| 7 | 1.051262e+09 | 2.749335e+13 | 0 | 32 | 1 | 2 | 3 | 1 | 1 | 2 | 0 | 2 | 1 | 1 |
| 8 | 1.162869e+09 | 2.312323e+13 | 0 | 50 | 1 | 1 | 5 | 1 | 1 | 6 | 0 | 2 | 3 | 0 |
| 9 | 1.277835e+10 | 1.071304e+13 | 0 | 25 | 2 | 2 | 11 | 8 | 1 | 1 | 0 | 2 | 1 | 0 |
Last rows
| CPF | CNPJ | PF_GENERO | PF_IDADE | PJ_PORTE | PJ_SETOR | PJ_IDADE | PJ_NUM_FUNCIONARIOS | CANAL_ATENDIMENTO | TEMA_ATENDIMENTO | ABORDAGEM_ATENDIMENTO | CATEGORIA_ATENDIMENTO | INSTRUMENTO_ATENDIMENTO | MEIO_ATENDIMENTO | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2173134 | 1.830029e+10 | 2.319734e+13 | 1 | 60 | 1 | 1 | 5 | 1 | 1 | 2 | 0 | 2 | 1 | 0 |
| 2173135 | 4.280041e+09 | 8.248686e+12 | 1 | 27 | 3 | 1 | 14 | 6 | 1 | 4 | 0 | 2 | 3 | 0 |
| 2173136 | 5.132182e+10 | 1.536300e+13 | 1 | 56 | 1 | 1 | 8 | 1 | 1 | 5 | 0 | 0 | 2 | 0 |
| 2173137 | 9.085473e+10 | 3.137770e+13 | 0 | 47 | 1 | 2 | 2 | 1 | 1 | 4 | 0 | 0 | 1 | 0 |
| 2173138 | 8.977295e+10 | 2.011362e+13 | 0 | 53 | 1 | 2 | 6 | 1 | 3 | 5 | 0 | 0 | 2 | 0 |
| 2173139 | 3.917310e+10 | 1.076769e+13 | 1 | 49 | 3 | 1 | 11 | 1 | 1 | 4 | 0 | 2 | 1 | 0 |
| 2173140 | 2.957830e+10 | 2.462614e+13 | 0 | 64 | 3 | 3 | 4 | 2 | 1 | 8 | 0 | 0 | 1 | 0 |
| 2173141 | 5.244178e+09 | 2.697920e+13 | 1 | 38 | 1 | 2 | 3 | 1 | 1 | 4 | 0 | 0 | 1 | 0 |
| 2173142 | 1.219942e+10 | 1.665197e+13 | 1 | 22 | 3 | 1 | 8 | 1 | 3 | 6 | 0 | 0 | 2 | 0 |
| 2173143 | 6.743369e+10 | 1.175931e+13 | 0 | 42 | 2 | 3 | 10 | 3 | 5 | 4 | 0 | 0 | 3 | 0 |
Most frequent
| CPF | CNPJ | PF_GENERO | PF_IDADE | PJ_PORTE | PJ_SETOR | PJ_IDADE | PJ_NUM_FUNCIONARIOS | CANAL_ATENDIMENTO | TEMA_ATENDIMENTO | ABORDAGEM_ATENDIMENTO | CATEGORIA_ATENDIMENTO | INSTRUMENTO_ATENDIMENTO | MEIO_ATENDIMENTO | count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 41247 | 8.086284e+09 | 1.160811e+13 | 1 | 34 | 3 | 2 | 10 | 20 | 2 | 5 | 0 | 2 | 1 | 0 | 12 |
| 65425 | 2.804770e+10 | 3.062270e+13 | 0 | 62 | 1 | 2 | 2 | 1 | 2 | 5 | 0 | 2 | 1 | 0 | 12 |
| 37197 | 6.996086e+09 | 8.656556e+12 | 0 | 44 | 2 | 1 | 13 | 9 | 2 | 5 | 0 | 2 | 1 | 0 | 11 |
| 59941 | 1.804636e+10 | 3.552998e+13 | 1 | 22 | 1 | 2 | 1 | 1 | 2 | 5 | 0 | 2 | 1 | 0 | 11 |
| 105348 | 8.291265e+10 | 1.352614e+12 | 1 | 54 | 2 | 1 | 24 | 3 | 2 | 5 | 0 | 2 | 1 | 0 | 11 |
| 6966 | 1.084221e+09 | 9.422790e+12 | 1 | 50 | 3 | 2 | 12 | 1 | 2 | 5 | 0 | 2 | 1 | 0 | 10 |
| 11422 | 1.765013e+09 | 3.224078e+13 | 0 | 49 | 2 | 2 | 2 | 1 | 2 | 5 | 0 | 2 | 1 | 0 | 10 |
| 14845 | 2.377626e+09 | 2.671682e+13 | 0 | 46 | 1 | 2 | 4 | 1 | 2 | 5 | 0 | 2 | 1 | 0 | 10 |
| 18685 | 2.995610e+09 | 2.880202e+13 | 1 | 44 | 2 | 1 | 3 | 2 | 2 | 5 | 0 | 2 | 1 | 0 | 10 |
| 43793 | 8.811834e+09 | 3.099636e+13 | 0 | 40 | 2 | 1 | 34 | 2 | 2 | 5 | 0 | 2 | 1 | 0 | 10 |